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Abstract 

Let r, s > 2 be integers. Suppose that the number of blue r-cliques in a red/blue coloring of the 
edges of the complete graph K n is known and fixed. What is the largest possible number of red 
s-cliques under this assumption? The well known Kruskal-Katona theorem answers this question 
for r = 2 or s = 2. Using the shifting technique from extremal set theory together with some 
analytical arguments, we resolve this problem in general and prove that in the extremal coloring 
either the blue edges or the red edges form a clique. 

1 Introduction 

As usual we denote by K s the complete graph on s vertices and by K s its complement, the edgeless 
graph on s vertices. By the celebrated Ramsey's theorem, for every two integers r, s every sufficiently 
large graph must contain K r or K s . Turan's theorem can be viewed as a quantitative version of the 
case s = 2. Namely, it shows that among all K r -iree n-vertex graphs, the graph with the least number 
of Ki (edges) is a disjoint union of r — 1 cliques of nearly equal size. More generally, one can ask the 
following question. Fix two graphs H± and H2, and suppose that we know the number of induced 
copies of Hi in an ra-vertex graph G. What is the maximum (or minimum) number of induced copies 
of Hi in G? In its full generality, this problem seems currently out of reach, but some special cases 
already have important implications in combinatorics, as well as other branches of mathematics and 
computer science. 

To state these classical results, we introduce some notation. Adjacency between vertices u and v 
is denoted by u ~ v, and the neighbor set of v is denoted by N(v). If necessary, we add a subscript G 
to indicate the relevant graph. The collection of induced copies of a fc-vertex graph H in an n-vertex 
graph G is denoted by \nd{H; G), i.e. 

Ind(#; G) := {X C V(G) : G[X] ~ H} 
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and the induced H -density is denned as 



Ik) 

In this language, Turan's theorem says that if d(K r ; G) = then d(K 2 ; G) < 1 - ^ and this bound 
is tight. For a general graph H, Erdos and Stone [5] determined maxe^i^; G) when d(H; G) = and 
showed that the answer depends only on the chromatic number of H. Zykov |17j extended Turan's 
theorem in a different direction. Given integers 2 < r < s, he proved that if d(K s ;G) = then 
d(K r ;G) < ^ s ~^2i)~ rS> ■ The balanced complete (s — l)-partite graphs shows that this bound is also 
tight. 

For fixed integers r < s, the Kruskal-Katona theorem [8j [10] states that if d{K r \ G) = a then 
gain, the bound is tight and is attained when G is a clique on some subset of 
the vertices. On the other hand, the problem of minimizing d(K s ;G) under the same assumption 
is much more difficult. Even the case r = 2 and s = 3 has remained unsolved for many years 
until it was recently answered by Razborov |13| using his newly-developed flag algebra method. 
Subsequently, Nikiforov pj] and Reiher [H] applied complicated analytical techniques to solve the 
cases (r, s) = (2,4), and (r = 2, arbitrary s), respectively. 

In this paper, we study the following natural analogue of the Kruskal-Katona theorem. Given 
d(K r ; G), how large can d(K s ; G) be? For integers a > b > we let Q a f, be the a- vertex graph whose 
edge set is a clique on some b vertices. The complement of this graph is denoted by Q ab . Let Q a 
denote the family of all graphs Q a ,b an d its complement Q a b for < b < a. Note that for r = 2 
or s = 2, the Kruskal-Katona theorem implies that the extremal graph comes from Q n . Our first 
theorem shows that a similar statement holds for all r and s. 

Theorem 1.1. Let r, s > 2 be integers and suppose that d(K r ; G) > p where G is an n-vertex graph 
and < p < 1. Let q be the unique root of q r +rq r ~ 1 (l — q) = p in [0, 1]. Then d(K s ; G) < M rjSi p+o(l), 
where 

M r , s , p := max{(l - p l l r ) s + sp l l r {\ - p 1 ^)- 1 , (1 - q) s }. 

Namely, given d(K r ;G), the maximum ofd(K s ;G) (up to ±o n (l)) is attained in one of two graphs, 
(or both), one of the form Q n ,t and another Q nt r. 

We obtain as well a stability version of Theorem Two n-vertex graphs H and G are e-close 
if it is possible to obtain H from G by adding or deleting at most en 2 edges. As the next theorem 
shows, every near-extremal graph G for Theorem |1.1| is e-close to a specific member of Q n . 

Theorem 1.2. Let r, s > 2 be integers and let p G [0,1]. For every e > 0, there exists 5 > and 
an integer N such that every n-vertex graph G with n > N satisfying d(K r ; G) > p and \d(K s ; G) — 
M r , s ,p\ < 5; is e-close to some graph in Q n . 

Rather than talking about an ra-vertex graph and its complement, we can consider a two-edge- 
coloring of K n . A quantitative version of Ramsey Theorem asks for the minimum number of 
monochromatic s-cliques over all such colorings. Goodman |7j showed that for r = s = 3, the opti- 
mal answer is essentially given by a random two-coloring of E{K n ). In other words, min^j d(K^; G) + 
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Figure 1: Illustration for the case r = s = 3. The green curve is (d^K^; Q n fln)i d(K^; Qnfin)) f° r 
9 G [0,1], and the red curve defined the same with Q n g n . The maximum between the curves is 
the extremal function in Theorem |1.1[ The intersection of the curves represents the solution of the 
max-min problem in Theorem |1.3| 

d(K^; G) = 1/4— o(l). Erdds [I] conjectured that the same random coloring also minimizes d(K r ; G) + 
d(K r ;G) for all r, but this was refuted by Thomason |15j for all r > 4. A simple consequence of 
Goodman's inequality is that mine max{d(K^; G), d(K^; G)} = 1/8. The following construction by 
Franek and Rodl [18] shows that the analogous statement for r > 4 is again false. Let H be a graph 
with vertex set [2] 13 , the collection of all 8192 binary vectors of length 13. Two vertices are adjacent if 
the Hamming distance between the corresponding binary vectors is a number in {1,4, 5, 8, 9, 11}. Let 
G be obtained from H by replacing each vertex with a clique of size n, and every edge with a complete 
bipartite graph. The number of and K4 in G can be easily expressed in terms of the parameters 
of H (see [IE]), for large enough n one can show that d{K±\ G) < 0.99 • ^ and d(K4] G) < 0.993 • gj. 

While the min-max question remains at present very poorly understood, we succeeded to com- 
pletely answer the max-min version of this problem. 

Theorem 1.3. 

maxmin{d(if r ; G),d(K r ; G)} = p r + o(l), 

G 

where p is the unique root in [0, 1] of the equation p r = (1 — p) r + rp(l — p) r ~ l . 

This theorem follows easily from Theorem |l.l[ Moreover, using Theorem |l,2[ we can also show that 
for every e > there is a 5 > such that every n-vertex graph G with min{ d(K r ; G), d(K r ; G)} > p r —5 
is e-close to a clique of size pn or to the complement of this graph. 

Here we prove these theorems using the method of shifting. In the next section we describe this 
well-known and useful technique in extremal set theory. Using shifting, we show how to reduce the 
problem to threshold graphs. Section [3] contains the proof of our main result for threshold graphs 
and section [4] contains the proof of the stability result. In Section [5] we sketch a second proof for the 
case r = s, based on a different representation of threshold graphs. We make a number of comments 



3 



on the analogous problems for hypergraphs in Section [6} We finish this paper with some concluding 
remarks and open problems. 



2 Shifting 



Shifting is one of the most important and widely-used tools in extremal set theory. This method 
allows one to reduce many extremal problems to more structured instances which are usually easier 
to analyze. Our treatment is rather shallow and we refer the reader to Frankl's survey article [6] for 
a fuller account. 

Let J- be a family of subsets of a finite set V, and let u, v be two distinct elements of V. We 
define the (u,v) -shift map S U —} V as follows! for every F G J~ , let 



We define the (u, w)-shift of F, to be the following family of subsets of V: S U -> V {F) := {S U ^ V (F,F) : 
F G J 7 }. We observe that \S u ^ v (J-)\ = \J~\- In this context, one may think of J 7 as a hypergraph over 
V. When all sets in T have cardinality 2 this is a graph with vertex set V. As the next lemma shows, 
shifting of graph does not reduce the number of /-cliques in it for every I. Recall that Ind(Ki;G) 
denotes the collection of all cliques of size I in G. 

Lemma 2.1. For every integer I > 0, every graph G, and every u ^ v G V(G) there holds 



Proof. Let A = S U ^ V (B,G), where B is an /-clique in G. First, consider the cases when u ^ B or 
both u, v G B or B \ {u} U {v} is also a clique in G. Then A = B and we need to show that B remains 
a clique after shifting. Which edge in B can be lost by shifting? It must be some edge uw in B that 
gets replaced by the non-edge vw (otherwise we can not shift uw). Note that vw is not in B, since 
B is a clique. Hence u,w G B and v G" B. But then B \ {u} U {v} is not a clique, contrary to our 
assumption. 

In the remaining case when u G B, v £ B and B \ {u} U {v} is not a clique in G, we need to show 
that A = B \ {u} U {v} is a clique after shifting S U -> V (G). Every pair of vertices in A \ {v} belongs to 
B and the edge they span is not affected by the shifting. So consider v ^ w G A. If vw G E(G), this 
edge remains after shifting. If, however, vw £ E(G), note that uw G E(G) since both vertices belong 
to the clique B. In this case vw = S u -> v (uw, G) and the claim is proved. □ 

Since shifting edges from u to v is equivalent to shifting non-edges from v to u, it is immediate 
that S u ^ v (Ind(Ki; G)) C lnd(Kf, S U -> V (G)). Therefore we obtain the following corollary. 

Corollary 2.2. Let G be a graph, let H = S U ^ V (G) and let I be a positive integer. Then 




(F U {v}) \{u} ifuGF.^F and (F U {v}) \ {u} G" J 7 , 



otherwise. 



S u ^ v (Ind(K i; G)) C Ind(K;; 5 U _>„(G)). 



d(Kf,H)>d(Ki;G) and d(Ki; H) > d(Ki; G). 
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We say that vertex u dominates vertex v if S V ^ U (J-) = T . In the case when J 7 is a set of edges 
of G, this implies that every w ^ u which is adjacent to v is also adjacent to u. If V = [n], we 
say that a family J- is shifted if i dominates j for every i < j. Every family can be made shifted by 
repeated applications of shifting operations Sj-n with i < j. To see this note that a shifting operation 
that changes T reduces the following non- negative potential function YIagJ 7 ^Zi&A As Corollary 



2.2 



shows, it suffices to prove Theorem 1.1 for shifted graphs. 

In Section [3] we use the notion of threshold graphs. There are several equivalent ways to define 
threshold graph (see [2]), and we adopt the following definition. 

Definition 2.3. We say that G = (V,E) is a threshold graph if there is an ordering of V so that 
every vertex is adjacent to either all or none of the preceding vertices. 

Lemma 2.4. A graph is shifted if and only if it is a threshold graph. 

Proof. Let G be a shifted graph. We may assume that V = [n], and i dominates j in G for every 
i < j. Consider the following order of vertices, 

3, N G (2)\N G (3), 2, iV G (l)\JV G (2), 1, V\N G (1) , 

where the vertices inside the sets that appear here are ordered arbitrarily. We claim that this order 



satisfies Definition 2.3 First, every vertex v ^ Nq(1) is isolated. Indeed, if u ~ v, then necessarily 
v ~ 1, since 1 dominates u. Therefore, vertex 1 and its non- neighbors satisfy the condition in the 
definition. The proof that G is threshold proceeds by induction applied to G[iV G (l)]. 



Conversely, let G be a threshold graph. Let v%, V2, ■ • • , v n be an ordering of V as in Definition 2.3 
We say that a vertex is good (resp. bad) if it is adjacent to all (none) of its preceding vertices. 
Consider two vertices V{ and Vj. It is straightforward to show that V{ dominates Vj if either (1) Vi is 
good and Vj is bad, (2) they are both good and i > j or (3) they are both bad and i < j. Therefore 
we can reorder the vertices by first placing the good vertices in reverse order followed by the bad 
vertices in the regular order. This new ordering demonstrates that G is shifted. □ 



3 Main result 

In this section, we prove Theorem |1.1| It will be convenient to reformulate the theorem, in a way 
that is analogous to the Kruskal-Katona theorem. 

Theorem 3.1. Let r, s > 3 be integers and let a,b > be real numbers. The maximum (up to 
±o n (l) / ) of the function f{G) := min{a • d(K s ;G),b ■ d(K r ;G)} over all n-vertex graphs is attained 
in one of two graphs, (or both), one of the form Q n j and another Q nt i. In particular, f(G) < 
max{a • a s , b ■ /3 r } + o(l), where a is the unique root in [0, 1] of a ■ a s = b ■ [(1 — a) r + ra(l — a) r ~ l \ 
and (3 is the unique root in [0, 1] of b ■ (3 r = a ■ [(1 — (3) s + sf3(l — /3) s_1 ]. 



We turn to show how to deduce Theorem 1 1 . 1 1 from Theorem 3.1 We assume that r, s > 3, since 
the other cases follow from Kruskal-Katona theorem. 
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Proof of Theorem \l . 1\ Let M be the maximum of d{K s ;G) over all graphs G on n vertices with 
d(K r ;G) > p. Fix such an extremal G with d(K r ;G) = p' > p and d{K s ;G) = M. Now apply 
Theorem 3.1 with a = p and b = M and the same n, r and s. The extremal graph G' that Theorem 



3.1 yields, satisfies 



/(C) > f(G) = mm{a ■ d(K s ; G), b ■ d(K r ; G)} = p ■ M, 



1.1 



as well 



hence d(K s ; G') > M and d(K r ; G') > p. Therefore, the same G' is extremal for Theorem 
and we know that the maximum in this theorem is achieved asymptotically by a graph of Qn- 

Note that we can always assume that in the extremal graph d(K r ; G') = p since otherwise we can 
add edges to G' without decreasing d(K s ; G') until d(K r ; G') = p is obtained. Therefore the maximum 
is attained either by a graph of the form Q np i/r n or by Q n> (i-q) n , where q r + rq r ~ 1 (l — q) = p. This 
implies that asymptotically the maximum in Theorem |1.1| is indeed 



M T)S)P = max{(l -p 1 ' 7 ') 3 + sp l/r {\ -p 



X/r\s— 1 



(1 -5)'}- 



□ 



By Corollary 2.2 and Lemma 2.4, f{G) is maximized by a threshold graph. We turn to prove 
Theorem |3.1| for threshold graphs. Let G be a threshold graph on an ordered vertex set V, as in 



Definition 2.3 There exists an integer k > 0, and a partition A\, . . . , A2k of V such that 

1. If v G Ai and u G Aj for z < j, then v < u. 

2. Every vertex in A21-1 (respectively A21) is adjacent to all (none) of its preceding vertices. 
Let Xi = and = ^yp- Clearly X^i=i( a; * + = 1- Up to a negligible error-term, 

r / \ s-1- 

fc-1 



d(^;G)=p(x,y):= 



vi=l 
' k 



i=l 
k 




d(K r ;G) = q(x,y):= +r-J] 



i=i 



r-l- 



X7' 



J=i 



Where x = (xi, X2, • • • , Xk) and y = (yi, y%, . . . , yk)- Occasionally, p will be denoted by p s and q by 
q r to specify the parameter of these functions. 

Our problem can therefore be reformulated as follows. For given integers k > 2, r, s > 3 and real 
a, b > 0, let W fc C M 2fc be the set 



(xi,x 2 , • . .,x k ,yx,y 2 , ...,y k )€R : x 



k >> 

i,yi>0 for all i and ^(x, + y 4 ) = 1 > 
i=i J 



Let p, q : W k — > K be the two homogeneous polynomials defined above, We are interested in maxi- 
mizing the real function 

</?(x, y) := mimja • p(x, y), 6 • g(x, y)}. 
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This problem is well defined since Wk is compact and ip is continuous. 

We say that (x, y) G Wk is non- degenerate if the set of zeros in the sequence (yi, 22, 2/2, • • • , ^fc, 2/fc), 
with xi omitted, forms a suffix. If (x,y) G Wfc is degenerate, then there is a non-degenerate (x',y') € 
VFfc with p(x,y) = p(x',y'). Indeed, if m = and 7^ for some 1 < i < k, let (x',y') G VFfc-i 
be defined by 

X — (iEl) . . . j Xi—l , Xj + 5 35 j+2 , • • • ; ^fc) 

y' = (yi,---,yi-i,Ui+i,---,yk) 

It is easy to verify that p(x, y) = p(x',y') and q(x, y) = (/(x'jy'). By induction on k, we assume 
that (x',y') is non-degenerate, and by padding x' and y' with a zero, the claim is proved. The case 
Xi = and yi 7^ is proved similarly. In particular, ip has a non-degenerate maximum in Wk- 

Our purpose is to show that the original problem is optimized by graphs from Q n . This translates 
to the claim that a non-degenerate (x, y) that maximizes (p is supported only on either xi, y\ or 2/1, ^2, 
which corresponds to either a clique Q n ^ or a complement of a clique Q n j, respectively. 

Lemma 3.2. Let (x, y) 6 Wk be a non-degenerate maximum of p. If x\ > 0, then for every i>2, 
x i = Vi = 0. On the other hand, if x\ = then yi = for every i > 2, and Xi = for every i>2>. 

Proof. We note first that the second part of the lemma is implied by the first part. Define x' by 



x' 



Xi + \ if i < k, 
if i = k. 



Clearly, if x 1 = 0, then p s (x, y) = q s (y, x'), q r (x, y) = p r (y, x'), and 

^'(y, x/ ) := mm{b-pr(y,yi'),a ■ g s (y,x)} = ¥?(x,y). 

Since ip attains its maximum when x\ = 0, maximizing it is equivalent to maximizing (p'(y, x'). Since 
(x,y) is non-degenerate, y\ > 0, and applying the first part of Lemma 3.2 for 99'(y,x') finishes the 
proof, by obtaining that for every i > 2, yi = x\ = 0. 

The first part of Lemma 3.2 is proved in the following lemmas. We successively show that X3 = 0, 
then ?/2 = and finally X2 = 0. 

□ 

Here is a local condition that maximum points of <p satisfy. 

Lemma 3.3. If p takes its maximum at a non- degenerate (x, y) 6 Wk, then a -p(x,y) = b- g(x, y). 

Proof. Note that < y\ < 1, since (x, y) G W is non-degenerate. We consider two perturbations of 
the input, one of which increases p(x, y), and the other increases g(x, y). Consequently, if a-p(x, y) 7^ 
b- <?(x, y), by applying the appropriate perturbation, we increase the smallest between a -p(x,y) and 
b ■ q(x, y), thus increasing min{a • p(x, y), b ■ q(x, y)}, contrary to the maximality assumption. 

To define the perturbation that increases p, let x' = x + iei and y' = y — iei , where < t < y\ , 
and ei is the first unit vector in Then, (x',y') G W and 



dp(x',y') 



s-l / k x -I 
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as claimed. 

In order to increase q, consider two cases. If x\ = 0, ley x' = x — te? and y' = y + tei, where 
< t < x 2 . Then, (x',y') G W and 

^-'( t+ s-r-(§»r >a 

If x\ > 0, we let x' = x — tei and y' = y + tei, where < t < X\. Then, 

□ 



Lemma 3.4. If (x, y) G Wk is a non- degenerate maximum of ip with x\ > 0, then x% = 0. 
Proof. Suppose, that X3 > and let 1 < I < m < k. Then 

c _i r / \ s—2- 

s 1 l-l 

+ 8(8-1) ^ 

r-1 



dp 
dxi 



dq 
dxi 



r ■ 




i=i 



Vi 



E ■ 

U=i+1 



and 



d 2 p 



s-2 



l-l 



dxidx m 

d 2 q 
dxidxm 



= 5(5-1)- J> +5(5-l)(5-2)-^ 



,i=l 



i=l 



E ; 

v j=j+l 



= 0. 



dearly ^ = 


8 2 p 


for / < m. 




" 1 


1 


1 " 


A = 


dp 


dp 


dp 




dxi 


d S 2 


8x3 




dq 


8q 


8q_ 




-dxi 


8x2 


8x3 _ 


It is easy to see that if (x, y 


) is I 


that B is positive definite. 




For a vector v G IE 


I 3 and 


e > 



d 2 P 


d 2 p 


8 2 p 1 


dxf 


dxidx2 


8x18x3 


d 2 p 


d 2 p 


8 2 p 


8x18x2 


dx\ 


8x28x3 


d 2 p 


d 2 p 


8 2 p 


8x18x3 


8x28x3 


8x 2 . 



s-3- 



8 2 p 
8x 2 



~8 2 p 8 2 p 8 2 p' 

dxf dxf dxf 

8 2 p 8 2 p 8 2 p 

dxf dx 2 dx?, 

8 2 p 8 2 p 8 2 p 

_8x\ 8x\ 8x\. 

d 2 p ^ 8 2 p ^ n 



X 4 = 



xi + evi 

Xj, 



if i < 3, 
if i > 3, 
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If A is invertible, let v be the (unique) vector for which 



A-v J 



In particular x\ = Xj. For e sufficiently small, 

p(x', y) = p(x, y) + e + 0(e 2 ) > p(x, y) 

g(x', y) = g(x, y) + e > g(x, y) 

contrary to the maximality of (x,y). 

If A is singular, pick some v / with A • v T = 0. Again Yli x i = Since B is positive 

definite, for a sufficiently small e, 

6 2 

p(x', y) = p(x, y) + - • v • B • v T + 0(e 3 ) > p(x, y) 



q{x',y) = ?(x,y), 

Contradicting Lemma |3.3| 

Lemma 3.5. // (x, y) £ is a non- degenerate maximum of ip with x\ > 0, i/ien y 2 = 0. 
Proof. Suppose, towards contradiction, that y 2 ^ 0. Let 



M 



h b 2 



where 



dp dp s _ 2 
ai = dx- 1 -dx-2 = ~ S(S ~ 1) ' yi ' X2 ' 



(>2 



dp dp 
dyi dy 2 



s ■ x 



-l 

2 ' 



dq dq 

dx\ dx 2 

dq dq 

dyi dy 2 



-r(r -1) ■ x 2 -y 2 



V2 



r-2 



□ 



If rank(M) = 2, then there is a vector v = y^J such that M • v = (J). Define x\ = x\ + evi,x' 2 = 
X2 — ev\ and y[ =yi + ev2, y' 2 = y 2 — ev 2 . Then x[ + x 2 + y[ + y' 2 = 1 and for sufficiently small e > 

i ' '\ i \ , ( d P d P i dp dp \ 2 

p(x,y) = p(x,y) + w m -—^j+OCe) 

= p(x, y) + e(ai«i + a 2 -u 2 ) + 0(e 2 ) = p(x, y) + e + <3(e 2 ) > p(x, y). 

Similarly q(x',y') = q(x, y) + e + 0(e 2 ) > g(x, y). Thus (x, y) cannot be a maximum of p. Hence, 
rank(M) < 1, and in particular 

oi 6i 
a 2 ^2 



det 



0, 
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which implies that 



= x*- 1 y r 2 - 1 ((r -!)(*- 1 )|-(| + 1 ) r l + 1 



The function 

ff (o;) = (r - l)(s - l)a - (a + l) r_1 + 1 

is strictly concave for a > and vanishes at 0. Since a = is not a maximum of 5, the equation 
g (jj^ = determines ^ uniquely. 



Denote a = — , and consider the following change of variables. 



x l + , 7VT_ 7V~ " x 2, ^2 = -, , 7~_ TV~ ' x 2 



(r - l)(s - l)a 

l + (r-l)(s-l)a ~ z 1 + (r - 1)0 - l)a 

y[ = yi + 2/2 = (a + 1)2/2, 2/2 = 



Clearly, x' x + x' 2 = x\ + X2 and + 2/2 =2/1+2/2- Moreover, 
y') = (2/'i) r + r • x[ ■ (yiY' 1 

/ , \r , ( , \r-l , r -X 2 • (2/1 +2/2) r ~ 1 

= (2/1 + 2/2) + r ■ xi ■ (yi + V2) + 



1 + (r - l)(s - l)a 
, r • (1 + a)"- 1 ■ x 2 • y^ 1 
(1 + a) r 

y') = 0'i + x' 2 ) s + s • yi • (x2) s_1 



(2/1 + 2/ 2 ) r + r • Xl ■ ( yi + ^r 1 + 1 n , „, r -i 2 = ?( x > y) 



= (x 1+ x 2 r + g .(« + l).( i | r -^-^ )a ) S l .y 2 -x^ 
> (xi + x 2 ) s + s ■ a ■ y 2 ■ x s 2 ~ l = p(x, y), 



Where the last inequality is a consequence of Lemma |3.6| below. This contradicts Lemma 3.3 □ 
Lemma 3.6. Let r, s > 3 be integers. Let a > be the unique positive root of 

(a + If" 1 - 1 = (r - l)(s - l)a. 

Then 

1 y- 1 l 



(r — l)(s — l)a) ^ a 

Proof. First, we show that (r — l)a > 1. Let t = (r — l)a and assume, by contradiction, that 
t < 1. For < t < 1, we have e* < 1 + 2i. On the other hand, e > (1 + a) 1//Q , implying e* > 
(1 + a) i//Q = (1 + a) r_1 . Thus we have 2t > (1 + a) r_1 — 1 = (r — l)(s — l)a = (s — l)i, which implies 
2 > s — 1, a contradiction. Therefore (r — l)a > 1. Also, since 1 + x < e x for all x > 0, we have that 
(l + ( r _i)( 1 s _i) a ) S 1 < e^- 1 )" . So it suffices to show that e^- 1 )" < 1 + 1. But since (r — l)a > 1, we 
have 

1 + - >1 + - ^ = r >3>e, 

a J a 

which finishes the proof of the lemma. □ 
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Lemma 3.7. If (x, y) € is a non- degenerate maximum of cp with x\ > ; then X2 = 0. 



Proof. This proof is very similar to the proof of Lemma 3.5 Now x\,X2, y\ > and x\-\- X2 + y\ = 1. 
Also 



p(x, y) = (xi + x 2 ) s + s-yi- x s 2 
g(x,y) =y[+r-x l -yl' 1 . 



s-l 



Let 

where 
ai 

0,2 



dp dp 

dx\ dx 2 

dp dp 

dy\ dx\ 



M 



ai a 2 
h b 2 



-s(s - 1) • yi ■ x s 2 



-s • ((xi +x 2 y 



x 2 )' 



b 2 



dq _ dq_ _ r r -\ 
dx± dx2 ^ 1 



dq dq 

dyi dxi 



r(r — 1) ■ x\ ■ y\ 



r-2 



If M is nonsingular, then there is a vector v = ( ^ J such that M • v = (J). Define x[ = x\ + e{v\ 
V2), x' 2 = X2 — ev\ and y[ = y% + et>2- Then x' x + x' 2 + y[ = I and for sufficiently small e > 



(dp cfp cfp \ 

^ (ui -" 2) -^ 1 + ^ 2 ) + ° (e2) 

= p(x, y) + e(aii>i + a 2 v 2 ) + 0(e 2 ) = p(x, y) + e + 0(e 2 ) > p(x, y). 

Similarly q(-x.',y') = g(x, y) + e + 0(e 2 ) > g(x, y) and therefore (x, y) cannot be a maximum of ip. 
Hence, 

ai b 1 
a 2 b 2 



dp dp 



det 



0, 



which implies 



= y{- 1 x s 2 - 1 (r-l).(s-l).^- P+l 



s-l 



+ 1 • 



Let 7 = — > 0. Then 1 + (r — l)(s — 1)7 — (1 + 7) s =0 and concavity of the left hand side shows 
that 7 is determined uniquely by this equation. Now make the following substitution: 

x[ = 

X 2 = X\ + X2 = (1 + 7) • X2 
1 



y'i 



y'2 



l + (r-l)(«-l)7 

(r-l)(s-l) 7 
l + (r-l)(s-l) 7 
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Clearly x' t + x' 2 = %\ + xi and y' x + y 2 = y\. Moreover 



p( X ',y') = (x' 2 r + s-y' l -(x' 2 y- 1 



(xi + x 2 ) s + s-yx-x s 2 1 = p(x, y 



(x, y') = (yi + y 2 ) r + r ■ x 2 ■ (y'2 



1 \r—l 



y\+r ■ 



(1 + 7) 



l)(s-l) 7 



\l + (r-l)(s-l)7 

> y{ + r • xi ■ y'^ 1 



r-1 



r-1 

■xi-y 1 



r 1 r— 1 / \ 



Where the last inequality follows from Lemma 3.6, with r and s switched. Again, this contradicts 
Lemma 13.31 □ 



By combining Lemmas 3.3 — 13.7[ we obtain a proof of Lemma 3.2 , which states that the maximum 
of (f is attained by a non-degenerate (x, y) supported only on either x\, y\ or yi,X2- In the first case, let 
x\ = a and y± = l — a. Then by Lemma 3.3 a-p(x, y) = a-a s = 6-q , (x, y) = b[(l — a) r + ra(l — a) r_1 ] 



and y?(x, y) = a ■ a s . In the second case, let y\ = (3 and X2 = 1 — /3. Then b ■ g(x, y) = b ■ j3 r = 
a ■ p(x, y) = a[(l — (3) s + s(l — /3) s_1 ] and </?(x, y) = b ■ f3 r . This shows that the maximum of tp is 
max{a • a s , b ■ (3 r } with a, /3 satisfying the above equations. In terms of the original graph, this proves 
that ip is maximized by a graph of the form Q n ^t or Q n t , respectively. In particular, our problem has 
at most two extremal configurations (in some cases a clique and the complement of a clique can give 
the same value of ip). 



4 Stability analysis 



In this section we discuss the proof of Theorem 1.2 In essentially the same way that Theorem 3.1 



implies Theorem 1.1 this theorem follows from a stability version of Theorem 3.1 



Theorem 4.1. Let r, s > 3 be integers and let a, b > be real. For every e > 0, there exists 5 > 
and an integer N such that every n-vertex G with n > N for which 

/(G) >m&x{a-a s ,b-p r }-5 



is e-close to some graph in Q n . Here f,a and (3 are as in Theorem 3.1 



Proof. If G is a threshold graph, the claim follows easily from Lemma 3.2 
graph, f(G) 



Since G is a threshold 



y(x, y) + o(l) for some (x, y) £ Wk and some integer k. As this lemma shows, the 
continuous function (p attains its maximum on the compact set Wfc at most twice, and this in points 
that correspond to graphs from Q n . Since f(G) is 5-close to the maximum, it follows that (x, y) must 
be e'-close to at least one of the two optimal points in W^. This, in turn implies e-proximity of the 
corresponding graphs. 

For the general case, we use the stability version of the Kruskal-Katona theorem due to Keevash 
[9]. Suppose G is a large graph such that f(G) > max{a • a s , b ■ f3 r } — 5. Let G\ be the shifted graph 
obtained from G. Thus G\ is a threshold graph with the same edge density as G, and f{Gi) > /(G) 
by Corollary 2.2 Pick a small e' > 0. We just saw that for 5 sufficiently small, G\ is e'-close to 
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Gmax £ Qn- As we know, either G max = Q n ^t or G max = Q n j for some < t < n. We deal with the 
former case, and the second case can be done similarly. Now \d{K2\ G)—d(K2', G m ax)\ < e', since G and 
Gi have the same edge density. Moreover, d(K s ; G) > d(K s ; G max )—5/a, because f(G) > f(G max )—5. 
Since G max is a clique, it satisfies the Kruskal-Katona inequality with equality. Consequently G has 
nearly the maximum possible X s -density for a given number of edges. By choosing e' and 5 small 
enough and applying Keevash's stability version of Kruskal-Katona inequality, we conclude that G 
and G max are e-close. □ 



5 Second proof 

In this section we briefly present the main ingredients for an alternative approach to Theorem 
We restrict ourselves to the case r = s. This proof reduces the problem to a question in the calculus 
of variations. Such calculations occur often in the context of shifted graphs. 

Let G be a shifted graph with vertex set [n] with the standard order. Then, there is some 
n > i > 1 such that A = {1, i} spans a clique, whereas B = {i + 1, n} spans an independent set. 
In addition, there is some non-increasing function F : A — > B such that for every j S A the highest 
index neighbor of j in B is F(j). Let x be a relative size of A and 1 — x relative size of B. In this 
case we can express (up to a negligible error term) 



fe-l 



l<j<xn 

fe-l 



= (l-xf + b(l-,)» ^ -(^"k^T) 

^ nx \ (1 - x)n I 

l<j<xn v v ' 7 

Let / be a non-increasing function / : [0, 1] — > [0, 1] such that f(t) = F ^il~y^ for every < t < ^ 
(Think of / as a relative version of F both on its domain with respect to A and its codomain with 
respect to B). Then we can express d(Kk\ G) in terms of x and / 

d(K k ; G) = (1 - x) fc + kx(l - xf- 1 j (1 - f{t)f- l dt = d(K~ k ; G xJ ). 

Jo 

Similarly one can show that 

d(K k ; G) = x k + kx k - l (l - x) [ (k - l)t k - 2 f(t)dt = d(K k ; G xJ ). 

Jo 

Note that in this notation, x = 9, f = (resp. x = 1 — 9, f = 1) corresponds to Q n ,6-ni (resp. Q n g. n ). 

To prove Theorem |1.1| for the case r = s = k, we show that assuming d(K k ;G x j) > a, the 
maximum of d(K k ;G x j) is attained for either / = or / = 1. For this purpose, we prove upper 
bounds on the integrals. 

Lemma 5.1. Iff : [0,1] — > [0,1] is a non-increasing function, then 



„i \ fc-i ' 

(1 - f{t)) k ~ l dt < max { 1 - ( / (jfe - l)t k - 2 f(t)dt )"",(!-/ (jfc - l)t k - 2 f{t)dt j 



o 
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The bounds in Lemma |5.1| are tight. Equality with the first term holds for / that takes only the 
values 1 and 0, and equality with the second term occurs for / a constant function. Proving Theorem 
|1.1| for such functions is done using rather standard (if somehow tedious) calculations. Lemma 5.1 
itself is reduced to the following lemma through a simple affine transformation and normalization. 

What non-decreasing function in [0, 1] minimizes the inner product with a given monomial? 



Lemma 5.2. Let g : [0, 1] — > [0, B] be a non- decreasing function with B > 1 and \\g\\k-i = 1. Then 

((k - l)t k -\g) = J\k - l)t k - 2 g(t)dt > min js ^1 - (l - ^ 
Equality with the first term holds for 

git)- 

The second equality holds for g = 1. 

We omit the proof which is based on standard calculations and convexity arguments. 




6 Shifting in hypergraphs 

In this section, we will discuss a possible extension of Lemma |2.1| to hypergraphs. Consider two set 
systems J-\ and Ti with vertex sets V\ and V2 respectively. A (not necessarily induced) labeled copy of 
T\ in J-2 is an injection I : V\ — > V2 such that 1(F) £ T2 for every F £ T\. We denote by Cop(J r i; F2) 
the set of all labeled copies of T\ in T2 and let 

t(T x \Ti) := |Cop(J-i;J- 2 )|. 

Recall that a vertex u dominates vertex v if S V ^ U (J-) = T . If either u dominates v or v dominates u 
in a family J 7 , we call the pair {u, v} stable in T . If every pair is stable in J 7 , then we call T a stable 
set system. 

Theorem 6.1. Let % be a stable set system and let J- be a set system. For every two vertices u,v of 
T there holds 

t(H;S u ^ v (F))>t(H;F). 
Corollary 6.2. Let G be an arbitrary graph and let H be a threshold graph H . Then 

t{H-S u ^ v {G))>t{H-G), 

for every two vertices u,v of G. 



Proof of Theorem 6.1 (sketch). We define a new shifting operator S u ^ v for sets of labeled copies. 
First, for every u,v £ V, and a labeled copy I : U — > V, define L u ^ v : U V by 

L(w) if L(w) 7^ u, v, 
I u ^ v (w) = < v if L(w) = u, 
u if L(w ) = v 
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For X a set of labeled copies, / E I, we let 




if I u -h-v £ X and Im(J) Pi {u, v} = {u}, 

if I u ^ v X, {u, v} C Im(J), and I _1 (u) dominates I (v) in 
otherwise. 



Finally, let S u -y v (T) : 



{S U -> V (I,X) : I € X}. Clearly, |5 u _s>^(X)| = |X|, and we prove that 



thereby proving that t(H; S U ^ V {F)) > t(H; J r ). As often in shifting, the proof is done by careful case 



7 Concluding remarks 

In this paper, we study the relation between the densities of cliques and independent sets in a graph. 
We show that if the density of independent sets of size r is fixed, the maximum density of s-cliques 
is achieved when the graph itself is either a clique on a subset of the vertices, or a complement of a 
clique. On the other hand, the problem of minimizing the clique density seems much harder and has 
quite different extremal graphs for various values of r and s (at least when a = 0, see [31 112j). 

Question 7.1. Given that d(K r ;G) = a for some integer r > 2 and real a S [0,1], which graphs 
minimize d(K s ; G) ? 

In particular, when a = we ask for the least possible density of s-cliques in graphs with inde- 
pendence number r — 1. This is a fifty years old question of Erdos, which is still widely open. Das et 
al [3], and independently Pikhurko [12], solved this problem for certain values of r and s. It would be 
interesting if one can describe how the extremal graph changes as a goes from to 1 in these cases. 
As mentioned in the introduction, the problem of minimizing d(K s ; G) in graphs with fixed density 
of r-cliques for r < s is also open and so far solved only when r = 2. 
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A Second proof in details 



We turn to desribe in details the second proof of Theorem |1.1| for fc-cliques and fc-anticliques, which 
was sketched in Section [5] Some of the purely technical parts of this proof are collected together in 
Lemma A. 2 First, we derive the proof of the theorem from Lemma |5.1| 
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Proof. Let p G [0,1], and q be the unique root of q k + kq k l (l — q) = p in [0,1]. We need to 
show that every x G [0,1] and non-increasing / : [0,1] — > [0,1] with d(Kk]G x j) > p satisfy that 
d(K k ;G x j) < $jfe(p), where 

$*(p) := M fe , fc , p = max{(l -p 1 /*)* + fc/A(i - p 1 /*)*" 1 , (1 - g) fc }. 

Namely, that d{K k ] G x j) is maximized when either / = or / = 1. 

By Lemma |5.1[ we can assume that / is either constant or that it only takes the values 1 and 0. 
Consider the latter, and for some y G [0, 1], let 



/(*) 



1 t<l-y 
t>l-y 



For convenience, we prove the statement for Gi— X f, thus we need to prove that 

max (1 - xf + kx(l - x)*" 1 ^ - yf' 1 s.t. 
x fc + fcc fc-1 (l - x)y >p 

is attained when either x = q, y = 1 or x = p 1 '^, y = 0. By monotonicity of d(K k ;G x j) and 
d(Kk; G x j), we assume that x fc + fcr fc_1 (l — x)y = p, hence a; G [q , ,p 1//fc ] and 

h 

p — X 

y 



kx k 1 (1 — x) 
We rewrite the objective function as 

/ k 

K{x) = (1 - x) k + kx{\ - xf' 1 ( 1 P ~ X 



kx k 1 (1 — x) 



k-l 



This part of the proof is completed in lemma A. 2 below, where it is shown (among other things) that 
the maximum of this function in the interval [(JSP 1 ^] occurs at an endpoint. 

We can now deal with the case of a constant /. First, note that &k{^k(p)) = P f° r every p G [0, 1]. 
Indeed, the curves (p, (1 - p 1 ' k ) k + kp l / k {\ - p 1 /*)*- 1 ) and (p, (1 - q) k ) are reflections of each other 
with respect to the line y = x. Therefore is symmertic with respect to this line. Let / = y be a 
constant function, and suppose that d{K^;G x j) = x k + /cx fc_1 (l — x)y > 3>fc(p). Then, as we have 
just shown, 

d(K k ; G xJ ) = (1 - x) k + kx{\ - x) k -\l - yf- 1 < $ fe ($ fe (p)) = p 
and the proof is completed. □ 

We now turn to deduce Lemma I5JJ from Lemma [ 



Proof of Lemma 5.1. Let / : [0,1] — > [0,1] be non-increasing and let h(t) = 1 — f(t). Then g(t) := 
fr(t)/||/t||fc_i is /fc_i-normalized and non-decreasing. We apply Lemma 5.2 with B = l/\\h\\k-i to 
conclude that 

((fc-i)t fc - 2 ,/*K mi f i (l . m k-i f -i) A 
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which we rewrite as 

((k - l)t k ~\ h) > min {l - (1 - \\h\\ k k zl) k -\ IHU-i} . 

In the first case, 

(i-WhWlzlY' 1 >i-((k-i)t k - 2 ,h) 

Since h = 1 — / this becomes 

(l " j\l " fW^dt) > 1 - J\k - l)t k ~ 2 (l - f{t))dt = f\k - l)t k - 2 f(t)dt 



which implies, 

Otherwise, 
which implies, 



(1 - f(t)) k - l dt <l-^j\k- l)t k - 2 f(t)dt" 



{{k-i)t k -\hy i >„,. llfc _ 1! 

(i - f\ k - i)t k - 2 f(t)dt^ > J\i - mf-^t. 



□ 

It only remains to prove Lemma |5.2| By a standard density argument it suffices to prove it for 
step functions, which we do in the following claim by induction on the number of steps. 

Claim A.l. Let g : [0,1] — > [0,B] be an non- decreasing step function with n > 2 steps. Namely, 
there is a partition X = (xq = < x\ < . . . < x n -\ < x n = 1) and real numbers = To < T\ < ... < 
T n < T n+ i = B such that 

V* 9 \[ Xi - 1 , Xi ]= T i- 

Suppose further that \\g\\k-i = 1. Let 1 < i < n— 1. Fix the partition X and all the Tj, except possibly 
Ti,Ti + i subject to the condition that the modified function is non- decreasing and has lk-i norm 1. 
Then (g, {k — l)t k ~ 2 ) is minimized when either T_i = T, T = T + i or T + i = T + 2- 

Proof. We need to solve the following optimization problem. 

Minimize (a^ -1 - x k Zl)U + {x\~l - x^ 1 )^, subject to (1) 
T_i < ti < ti + i < T + 2 (2) 

• k— 1 i j.k— 1 rrrfc — 1 | rrik — l / n \ 

where [ii = Xi — Xi-\. By Equation ^ 



H+l 



r^ + f^^-r 1 ))" 1 - (J) 
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The inequalities ti < ti + \ < Tj + 2 in ^ yield 



i 

k— 1 i ,. rpfc— 1\ fe — 1 



and 



i 

fc-i 



u > ^ max{0, ^Tt l + /Wgg ~ j . (6) 

Therefore, we can restate our problem as the following optimization problem 

Minimize (xf" 1 - afc 1 )** + (xfe? " + ^(^^ - t?" 1 ) 

subject to ([5]), (|6]) and t, > Tj_i 

This problem is feasible since ti = Ti is a valid solution, and therefore the domain is an interval. 
The objective function is well defined in this segment due to ^ and is concave. This is because it 
is a positive linear combination of a linear function and a concave function of the form (a — bx n )™ . 
Therefore, it is minimized at an endpoint of the interval, which corresponds precisely to ti = Tj_i, 
U = ti + i or ti+i = Ti + 2- 

□ 



We now turn to the proof of lemma |5.2 



Proof of Lemma 5.2, Let g : [0, 1] — > [0, B], with Zfc_i-norm 1, be a non-decreasing n-step function. 
Case 1. If n = 1 then g is constant, therefore g = 1 and (g, (k — l)t k ~ 2 } = 1. 
Case 2. If n = 2, we use claim A.l for i = 1 to conclude that Ti = or T2 = B. 
Case 2.1. If Ti = 0, then 



T^" 1 • (1 -xi) = 1 =>- xi = 1 



1 



T- 



2 



and therefore 



\ fc-i N 

1 



(g,(k-l)t^)=T 2 I 1-1-^1 j > 



min < £M 1 



1 



2 / 



The last concavity inequality is shown in lemma A. 2 
Case 2.2. If T 2 = B, then 
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hence 1 — Bk 1 _ 1 < x\ < 1, and 

(g, (k - l)t k ~ 2 ) = x^T, + (1 - x k x - l )B = 
sf" 1 "^ 1 [B k - l Xl - B k ~ l + l) - Bx k - 1 + B. 
By lemma A. 2 this is a concave function of x%, and is therefore minimized either for x\ = 1, where 



g is a 1-step function, or for x\ = l— R k-i ? where Ti = which returns us to case (2.1). 



Case 3. If n = 3, we use claim A.l for i = 1,2, and obtain a 3-step function with Xi = and T3 = i?. 
Therefore, 



B k - l x 2 - B k ~ l + n 



(X 2 " Xl)^" 1 + (1 " Zs)^" 1 = 1 

\ 22 — £1 

Since < T 2 < -B we conclude that < xi < 1 — gfc 1 _ 1 and 1 — Bk 1 _ 1 < £2 < 1. Furthermore, 

( 5 , (fc - l)t k ~ 2 ) = (xt 1 - x^ 1 )^ + (1 - xt^B = 
— (B k - 1 x 2 - B k - 1 + 1) + B - Bx k 2 x 



(X2— Xl) k ~ 



By lemma 



A.2 



for fixed x 2 > 1 — B l-i , this is minimized when either x\ = 0, which is precisely case 
(2.2), or when x\ = l— B l_ x , implying that T 2 = T3 = B, which brings us back to case 2. 
Case 4. If n > 4, we apply claim A.l| for % = 2 and reduce the number of steps without increasing 
(<?, (A; — l)i fc ~ 2 ). This completes the proof by induction. 

□ 

We finally provide a proof for the following technical lemma. 
Lemma A.2. 1. Let a £ [0, 1] and Cfc(a) the root of x k + /cx fc_1 (l — x) = a in [0, 1]. The function 



i<T(x) = (1 - xf + fc(l - xf-'x 1 - , 
has no local maximum in (cfc(a),a). 

2. Letf(x) = x(l-(l-^) r ), r > 2 an integer, B>1 andx G [1,-B], then fix) > min{/(l), /(J3)}. 

5. T/ie function 

g ( x ) = x r-l (B r x - B r + l)r - Bx r + 5 
is concave in (1 — -gjr, 1), where r > 2 an integer. 

4- The function h(x) = a - r ~ xr 1 has no local minimum in (0, a). Hence, for fixed x 2 > 1 — 

(a— s)? 
fe-1 fe-1 

^ ( 5 fc_1 S2 - B k ~ l + l) ^ + B — Bx\~ x 



(x 2 — Xl) k ~ 1 

is minimized at an endpoint of the interval < x\ < 1 



B k-i ■ 
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Proof. 1. We define 



„, . , a fc -x fc x k + kx k 1 (l-x)-a k 
Z = Z(x) = 1 — — 



A;x fc ~ 1 (l — or) &x fe_1 (l 



Note that Z varies between and 1 as x ranges over the interval [cfc(a), a] and in particular Z(ck(a)) 
and Z(a) = 1. Also, 

logZ = log(x fe + fcx fe_1 (l - x) - a fc ) - log A; - (k - l)logx - log(l - x). 

Therefore, 

Z' _ fc(fc - l)x fe - 2 (l - x) k-l 1 k-l k-1 1 



+ 



Z x k + kx k l (l — x)—a k x 1 — x xZ x 1 — x 
and 

z , = k-l _ ^_ = (kx-k + l)Z + (k-l)(l-x) 
x 1 — x x(l — x) 

We need to show that K{x) = (1 — x) k + kx(l — x) k ~ 1 Z k ~ 1 has no local maximum in the interval 
[c(a),a\. 

K'(x) = -k(l - xf- 1 + k(l - x) k ~ 1 Z k - 1 - k{k - l)x{l - x) k ~ 2 Z k - 1 + 
k{k - l)x(l - x) k - x Z k - 2 Z' = 
-k{l - xf- 2 (l-x + {kx- l)Z k ~ l - (jfe - l)Z k ' 2 {(kx - k + 1)Z + (jfe - 1)(1 - x))) = 

-k(l - x)^ 1 (k(k - 2)Z k - 1 - (jfe - l) 2 Z fc - 2 + l) . 

Suppose xo £ (c(a), a) is a critical point, we claim that Z(xq) < 1 — \. Indeed, Z(xo) is a root of the 
polynomial q(z) = k(k - 2)z k ~ l — {k— l) 2 z k ~ 2 + 1. Since 

q'( z ) = (jfe - 2)(ife - l)/- 3 (fez -k + 1), 

q is increasing in [1 — |, 1] and therefore for every z £ [1 — |, 1), < = 0. Hence q has no 
roots in this interval, which therefore does not contain Z(xq). 
Now, we compute the second derivative of K2 in xo, 

KZ(x ) = -k(l - xo)^ 1 • (k - l)(k - 2)Z k - 3 (x )Z'(x )(kZ(x ) - k + 1) > 

since Z > 0, Z' > 0, and (kZ(xo) — k + 1) < 0. This proves that xo is not a local maximum. 

2. It is sufficient to see that /i(x) = /(^) has no local minimum in (0, 1). Indeed, 

x 

Therefore, 

, r 2 x r (l - x r ) r_1 + (1 - x r ) r - 1 

Ii\ x ) — 9 

x z 
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Denote y(x) = 1 — x r and q(y) = r 2 (l — y)y r 1 + y r — 1. Note that 

</(y) = ~ r ( r - i)y r-2 ((i + r )y - r), 

which implies that q is decreasing if y € [^j, 1], therefore (/ has no roots in this interval as q(y) > 
g(l) = 0. Consequently, if xq < 1 is a critical point of fi, then y(xo) < ^j- The numerator of the 
second derivative of /i in xo is 

g'(y(x )) • y'(*o) = r\r - \)x^ x y{x^-\{\ + r)y - r) < 0, 

and /i does not have a local minimum. 

3. Consider the change of variables y = B r x — B r + 1 and denote a = B r — 1. Then, 

sw = — j^zi Br 2_ x + B i ye (0,1), a>o 

We need to show that for every a > 0, the function 

r 2 — 1 1 

G(y) = (y + a)^^yr - (y + a) r 

is concave in (0, 1). Indeed, 

nil t \ (r 2 -l)(r 2 -r-l) / . ,r 2 -2r-i 1 

g (y) = 1 — -(y + «) r y + 

„ r 2i r 2 -r-l 1-r r 2 - 1 l-2r 

2^2-(y + a) »■ y - +-pr{y + a) r y r - 
r(r - l)(y + a) r ~ 2 . 

We need to prove that G" < 0. At a = 0, 

r— 2 

= ^-2- ((r 2 - l)(r 2 - r - 1) + 2(r 2 - 1) + 1 - r - r 3 (r - 1)) = 0. 
If a > 0, let c = | > 0. G"(y) can be written as 

r—2 t 

a , , r z ~2r-\ l-2r , { Q o •} , > 1 2r-l \ 

-^-(c+1) F c — ( r _ l) + 2rc-l-r i (l + c)rc~J . 



We need to prove that 

r 3 c 2 + 2rc - 1 < r 3 c 2 ^1 + ^ *" 
We multiply by ^ and let z = £ to rewrite this as 

r 3 + 2rz - z 2 < r 3 (l + z)r 
For 2 = this holds as equality, so it is sufficient to prove the inequality for the derivatives, 

2r -2z <r 2 (l + zf^ . 
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For z > r, this inequality holds as 2r — 2z < < r 2 (l + z) — . For < z < r we can raise both sides 
to the r-th power, 

2 r \r - zf \\ + z)'- 1 < r 2r 

This holds for z = and z = r, so we only need to check that it holds for critical points. If z G (0, r), 
then, 

d((r-z) r (l + z) r - 1 ) _ r 2 - 2r 

(fe ~ ^ * ~ 2r - 1 ' 

hence, for < z < r, 

r-1 /orY r + r \ 



2 r (r-z) r (l + z) r ~ i < 2 
Consequently, it is enough to prove that 



2r - 1 J V 2r - 1 



l 2r - 1 / \2r-l 



which we write as. 

/2r-iy r_1 /2r 2 -r\ r_ ^ / \/2r- - r 
1 -V27^2j V(r + l) 2 J ^ 2^ J 

For r > 4, each of these three terms is greater than 1, and for r = 2, 3 it can be verified by assignment. 

4. Let r r 

h( \ a x 
h(x) = r . 

(a — x) r 

Then, 

(r 2 — l)x r — r 2 ax r ~ l + a r 



h'(x) 



r{a — x) r 



If < xq < a is a critical point, then it is a root of the polynomial q{x) = (r 2 — l)x r — r 2 ax r ~ 1 + a r . 
Note that q'(x) = r(r — l)x r_2 ((r + l)x — ra), and therefore positive for x £ [fqpi a > a ]- Hence, for 
such x, q(x) > q(a) = 0, and therefore xq < ^rjti- The numerator of the second derivative of h in xo 
equals to q'(xo) which is thereby negative. Hence, xq is not a local minimum. □ 

B Details of shifting in hypergraphs 

In this section we describe the full proof of Theorem 6.1 which was sketched in Section [6j Theorem 



6.1 states that the number of labeled copies of a stable set system H in an arbitrary set system J- 



does not decrease after the shifting, i.e., 



23 



Proof of Theorem 6. 1 , Let us recall the definition of the shifting operator S u -^ v for sets of labeled 
copies. For X a set of labeled copies, we defined S U ^ V (X) = {S U ^ V (I ,X) : I G X}, where 



Su— vv\I — 
and I u ++v : U V was defined by 

I u ^ v (w) 



I u ^v if Iu^v X and Im(J) n {«, = {u}, 

i u ou if lu^u X, {it, C Im(i), and 7 _1 (u) dominates i _1 (v) in %, 
I otherwise, 



I(w) if I(w) 7^ u, u , 
v if i(n>) = 
u if /fiu) = w. 



Henceforth, let X := Cop(%; X") be the family of all labeled copies of H in X", let X"' = S U ^. V (P) be 
the shifted set system and let X' = S U ^ V (X) := {S U ^ V (I ,X) : I G X} be the set of all labeled copies 
after the shifting. Clearly |X'| = |X|, thus in order to show t(H;P) > t(H;P), it suffices to prove 
that 

S U -^(I) C Cop(^;^'). 

Let I G X be an arbitrary labeled copy of %, and let I' = S U ^ V (1 ,1). For any H € T~L, it is true that 
I(if) G X". Let us now show, by careful case analysis, that I'(H) G X"' for all H G thereby proving 
that I' G Cop(%;X"'). One of the following is true 

(i) Im(i') n {u,v} = 0: Clearly / = I'. On the other hand, since 1(H) n {u, v) = 0, it is true that 
S U ^ V (I(H);P) = 1(H), hence 1(H) G P. Thus I' (if) G P. 

(ii) Im(i') n {u, v} = {u}: In this case, the equality I' = I holds. But S U ^ V (I,X) = I implies that 
Iu++ V G X, and thus it is true that I U <^ V (H) G X". It becomes clear that S U ^ V (I(H); P) = 1(H) 
since I U ^ V (H) G X", hence i(ii) G X"'. Thus I'(H) G X"'. 

(iii) Im(J') n {u, v} = {v}: Let us now analyze two subcases: 

(a) I = I': This is the easy subcase. Since u G" 1(H), it is true that S U -> V (I(H); X") = 1(H), 
hence /(F) G P. We conclude that i'(iT) G P. 

(b) / = / w Let X = S U -> V (I(H),P). UX = I U ^ V (H) then clearly J'(ff) = I U ^ V (H) G 
X"'. U X ^ I U ^ V (H), it must be true that X = I (if) and i(if) n {u, t>} = {u}. But 
S U ^ V (I(H),P) = 1(H) only if i^„(if) G X". Hence I U ^ V (H) = S U ^ V (I U „ V (H),P) G P, 
because v G I U ^ V (H). 

(iv) Im(i') n {u, f} = {u, v}: Dividing into three more subcases: 

(a) I u ^>v G X: We have I' = I. Clearly S n -> V (I(H),P) = 1(H) because I U ^ V (H) G X", hence 
I'(H) = 1(H) G P. 

(b) I u ^v X but I (u) does not dominate I^(v) in %: Again, it is true that I' = I. 
Moreover I~ l (v) must dominate I~ l (u) in H, because Ti is stable. We will now show that 
S U ^ V (I(H),F) = 1(H). Suppose, towards contradiction, that S U ^ V (I(H), P) ^ 1(H). 



24 



This can only happen when 1(H) D {u,v} = {u} and I U ^ V (H) F. Let 
H' = (H \J {I~ 1 (v)})\{I~ l (u)}. Because dominates I~ l (u) in 7{, we must have 

Sj-ir u \^.j-if v \(H,'H) = H, hence H' G "H. But this is a contradiction because I U ++ V (H) = 
I(H') and /(#') G J - since #' G U. Therefore 1(H) = S U ^ V (I(H),F) G F' . 

(c) I u ^v ^1- & n d / dominates / ^(v) in The identity I 1 = I u ^ v holds. If 

S U ^ V (I(H),F) = Iu^v(H), then clearly I'(H) G J 7 . If S U -> V (I(H), F) ± I U ^ V (H), then it 
is either because 

i. I(H)t~){u, v} = {u} and I U ^ V (H) G F: It is true that J uo „ (#) = S U ^ V (I U ++ V (H),F) G 
J 7 '; or because 

ii. /(#) n = {f}: Let H' = (H U {I -1 ^)}) \ Since dominates 
/-» in clearly H' G Thus /(#') G F, hence I U ^ V (H) = I(H') G J". Therefore 
lu^v(H) = S U -> V (I U ^ V (H), F) G J 7 '; 

In all cases we have I'(H) G F' , therefore I' G Cop(%; F'), which implies F') > t(H; F). □ 
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